A low bit rate speech coding method using a formant-articulatory parameter nomogram

نویسندگان

  • Hiroshi Ohmura
  • Akira Sasou
  • Kazuyo Tanaka
چکیده

In this paper, we propose a new method for low bit rate speech coding using a nomogram that is a pair of codebooks representing the functional relationship between formant frequencies and articulatory parameters. Significant features of our approach are 1) using the codebooks derived theoretically from the computation using a stylized vocal tract model and 2) independent coding by separating frequency information from the amplitude in a speech segment. From these features, the method is also characterized by little dependency upon speech databases and/or languages in the acoustic domain, so that it has a potential to construct a more flexible rule-based speech synthesis system. We have conducted articulatory encode-decode experiments with the bit rate range from 3.2kbps to 1.6kbps using speech samples in ASJ and TIMIT speech databases and confirmed that good quality speech synthesis is achieved with improvements on the bit allocation scheme and a frame sampling method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segmental feature extraction and coding for speech synthesis

This paper describes a segmental feature extraction and speech coding method in an acousticarticulatory domain using nomograms that represent a mapping between formant frequencies and articulatory parameters. The vocal tract model is a modified Fant model, in which we newly introduced a parameter for successively adjusting vocal tract lengths. We investigated first the relationship between form...

متن کامل

Segmental Featurs Extraction and Coding for Speech Synthesis

This paper describes a segmental feature extraction and speech coding method in an acousticarticulatory domain using nomograms that represent a mapping between formant frequencies and articulatory parameters. The vocal tract model is a modified Fant model, in which we newly introduced a parameter for successively adjusting vocal tract lengths. We investigated first the relationship between form...

متن کامل

A hybrid time-frequency domain articulatory speech synthesizer

High quality speech at low bit rates (e.g., 2400 bits/s) is one of the important objectives of current speech research. As part of long range activity on this problem, we have developed an efficient computer program that will serve as a tool for investigating whether articulatory speech synthesis may achieve this low bit rate. At a sampling frequency of 8 kHz, the most comprehensive version of ...

متن کامل

Estimation of articulatory parameter trajectory from speech acoustic dynamics

This research aims to perform articulatory analysis as a basis for low bit-rate speech coding. The classical approach consists of gathering a large set of acoustic and articulatory vector pairs in a codebook. Then, based on some criteria, the non-uniqueness of the articulatory trajectories is solved using a dynamic optimization procedure. An articulatory codebook requires a model capable of gen...

متن کامل

Articulatory analysis using a codebook for articulatory based low bit-rate speech coding

Fundamental to the success of the articulatory based speech coding is the mapping from acoustics to articulatory description. As the mapping is not unique and based on articulatory continuity criteria, the non-uniqueness of the articulatory trajectories is solved using a forward dynamic network. In this paper, we present new results on forward dynamic network used to estimate articulatory traje...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000